Information Retrieval Baselines for the ResPubliQA Task
نویسندگان
چکیده
This paper describes the baselines proposed for the ResPubliQA task. These baselines are purely based on information retrieval techniques. The selection of an adequate retrieval model that fit the specific characteristic of the supplied data is considered as a core part of the task. Applying a not adequate retrieval function would return a subset of paragraphs where the answer could not appear, and thus the posterior techniques applied in order to detect the answer within the subset of candidates paragraphs will fail. In order to check the ability to retrieve the right paragraph by a pure information retrieval approach, two baselines are proposed. Both of them use the Okapi-BM25[2] ranking function, with and without a stemming pre-process respectively. The main aim was to prove how well can a pure information retrieval system perform on this task.
منابع مشابه
JU_CSE_TE: System Description QA@CLEF 2010 - ResPubliQA
Abstr act. The article presents the experiments carried out as part of the participation in the Paragraph Selection (PS) Task and Answer Selection (AS) Task of QA@CLEF 2010 – ResPubliQA. Our System use Apache Lucene for document retrieval system. All test documents are indexed using Apache Lucene. Stop words are removed from each question and query words are identified to retrieve the most rele...
متن کاملDocument Expansion for Cross-Lingual Passage Retrieval
This article describes the participation of the joint Elhuyar-IXA group in the ResPubliQA exercise at QA&CLEF 2010. In particular, we participated in the English–English monolingual task and in the Basque– English cross-lingual one. Our focus was threefold: (1) to check to what extent information retrieval (IR) can achieve good results in passage retrieval without question analysis and answer v...
متن کاملTemporal Information Needs in ResPubliQA: an Attempt to Improve Accuracy. The UC3M Participation at CLEF 2010
The UC3M team participates in 2010 in the second ResPubliQA evaluation campaign taking part in the monolingual Spanish task. On this occasion we have completely redesigned our Question Answering system, product of multiple efforts while being part of the MIRACLE team, by creating a whole new architecture. The aim was to gain in modularity, flexibility and evaluation capabilities that previous v...
متن کاملOverview of ResPubliQA 2009: Question Answering Evaluation over European Legislation
This paper describes the first round of ResPubliQA, a Question Answering (QA) evaluation task over European legislation, proposed at the Cross Language Evaluation Forum (CLEF) 2009. The exercise consists of extracting a relevant paragraph of text that satisfies completely the information need expressed by a natural language question. The general goals of this exercise are (i) to study if the cu...
متن کاملThe LogAnswer Project at ResPubliQA 2010
The LogAnswer project investigates the potential of deep linguistic processing and logical reasoning for question answering. The paragraph selection task of ResPubliQA 2010 offered the opportunity to validate improvements of the LogAnswer QA system that reflect our experience from ResPubliQA 2009. Another objective was to demonstrate the benefit of QA technologies over a pure IR approach. Two r...
متن کامل